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Richter syndrome (RS) derives from the rare transformation of chronic lymphocytic 
leukemia (CLL) into an aggressive lymphoma, most commonly of the diffuse large B cell 
lymphoma (DLBCL) type. The molecular pathogenesis of RS is only partially understood. 
By combining whole-exome sequencing and copy-number analysis of 9 CLL-RS pairs and of 
an extended panel of 43 RS cases, we show that this aggressive disease typically arises 
from the predominant CLL clone by acquiring an average of '^20 genetic lesions/case. 
RS lesions are heterogeneous in terms of load and spectrum among patients, and include 
those involved in CLL progression and chemorefractoriness [TP53 disruption and N0TCH1 
activation) as well as some not previously implicated in CLL or RS pathogenesis. In particu- 
lar, disruption of the CDKN2A/B cell cycle regulator is associated with ^30% of RS cases. 
Finally, we report that the genomic landscape of RS is significantly different from that of 
de novo DLBCL, suggesting that they represent distinct disease entities. These results pro- 
vide insights into RS pathogenesis, and identify dysregulated pathways of potential diag- 
nostic and therapeutic relevance. 



Richter syndrome (RS) is defined as the devel- 
opment of a lymphoma, most commonly of the 
diffuse large B cell lymphoma (DLBCL) type, 
during the clinical progression of B cell chronic 
lymphocytic leukemia (CLL; Swerdlow, 2008; 
Rossi and Gaidano, 2009). This event occurs 
in the natural history of CLL with a cumulative 
incidence of ^^10% (Rossi and Gaidano, 2009). 
Analysis of immunoglobulin (Ig) gene rear- 
rangements showed that RS derives from the 
same B cell that originated the CLL clone. De- 
spite being classified histologically as DLBCL, 
clonaUy-related RS is characterized by a much 
poorer outcome, as most of the patients do not 
respond to standard therapy and succumb rap- 
idly to the disease (Tsiniberidou et al., 2008; 
Rossi etal., 2011b). 

The molecular pathogenesis of RS is only 
partially understood. Inactivation of the TP53 
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tumor suppressor gene is obsei-ved in ^60% of 
patients and represents an independent predic- 
tor of overall survival after RS transformation 
(Rossi et al., 201 lb). The second most frequent 
genetic lesions are represented by NOTCH! 
and MYC activating events, each occurring in 
about ^30% of cases (Fabbri et al., 2011; Rossi 
et al., 2011b). Similar to other examples of evo- 
lution of mature B cell— derived tumors from 
indolent to aggressive disease, MYC-activating 
events often coexist with TP53 disruption 
in RS (Yano et al, 1992; Davies et al, 2007), 
whereas they distribute in a mutually exclusive 
fashion with NOTCHl mutations (Fabbri 
et al., 2011), consistent with the notion that 
NOTCHl is a direct transcriptional activator 
of MYC expression (Palomero et al., 2006). 
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These genetic abnormalities are often acquired at transforma- 
tion, suggesting their importance for the estabhshment and 
maintenance of the phenotype characteristic of RS (Fabbri 
et al, 2011; Rossi et al., 2011b). 

The known RS-associated genetic lesions have been identi- 
fied based on candidate gene approaches, do not account for aU 
cases, and most likely represent only a fraction of those associ- 
ated with RS pathogenesis. Conversely, unbiased, genome- 
wide methods aimed at the characterization of the entire 
spectrum of genetic lesions that are acquired by RS can be use- 
fiol to decipher the clonal evolution pattern occurring in the 
CLL-RS transition process, to provide further insights into its 
pathogenesis, and to identify possible differences between RS- 
associated DLBCL and de novo DLBCL (Morin et al., 2011; 
Pasqualucci et al., 2011; Lohr et al, 2012; Zhang et al., 2013). 
Toward this end, we have adopted an integrated approach based 
on next generation whole-exome sequencing (WES) and 
genome-wide high-density single-nucleotide polymorphism 
(SNP) analysis to investigate the RS coding genome. 

We report that most RS cases derive from the same clone 
as CLL cells through a linear evolution pattern involving the 
acquisition of novel genetic lesions. These lesions are heteroge- 
neous in number and spectrum among RS cases, and include 
genes not previously associated with RS transformation. Fi- 
nally, our results demonstrate that RS represents a genetically 
distinct entity from de novo DLBCL. 

RESULTS 

Most RS arise through a linear evolution 
pattern from the CLL predominant clone 

Transformation of CLL to RS can occur as a linear process, in 
which RS is a direct descendant of the initial major CLL clone, 
or by branched evolution fr-om a common earher precursor cell 
that acquired distinct genetic lesions to become a CLL or a RS. 
To decipher the clonal evolution pattem of CLL to RS, we 
performed next-generation WES, and copy number (CN) and 
fluorescence in situ hybridization (FISH) analysis for the detec- 
tion of translocations known to be recurrent in RS and/ or 
de novo DLBCL (i.e., MYC, BCL6, and BCL2 translocations) 
in a set of nine paired and clonally related CLL-RS cases. In 
three cases, we also included the specimen corresponding to the 
terminal leukemic phase of RS (RSII), which represents an 
extremely rare and aggressive phenotype of the disease, charac- 
terized by severe cytopenia and leukemic dissemination of large 
B cells within the peripheral blood, and invariably associated 
with patients' death (Table SI; Rossi and Gaidano, 2009). Ad- 
ditionally, an independent set of six paired CLL-RS cases 
with only CN data available was included in the analysis (total, 
11 = 15 pairs). 

To reconstruct the evolutionary history of CLL transforma- 
tion, we assessed separately the load of genetic lesions shared 
between the two phases of the disease, those present in CLL and 
absent in RS, and those present in RS but not in CLL (Fig. 1). 
This analysis allowed us to discriminate between a linear pattern 
of evolution, in which all CLL lesions are maintained in the RS 
specimen together with additional RS-specific abnormalities, 
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and a branched pattern of evolution, in which CLL- and RS- 
specific lesions are present in addition to the set of shared ab- 
normaUties. Due to the unavailability of paired normal DNAs, 
mutations present in both CLL and RS were counted only for 
genes known to be mutated in CLL (i.e., NOTCHl, SF3B1, 
TPS 3, ATM, and MYD88). Genetic lesions that were only ob- 
served in the CLL phase because of CN losses or copy neutral- 
LOH in RS were not computed in the analysis. 

In the majority of RS patients {n = 12/15), all the abnor- 
mahties identified in the CLL phase of the disease were main- 
tained in the RS phase, which, in turn, acquired a variable 
number of genetic lesions. This scenario is consistent with a 
linear pattern of evolution from CLL to RS, as observed for 
other tumor progression states (Walter et al., 2012; Aparicio 
and Caldas, 2013) and for a subset of progressed CLLs (Schuh 
et al, 2012; Landau et al., 2013; Ouillette et al, 2013; Fig. 1; 
Table S2 and Table S3). In one patient (case 7), a subclonal 
13ql4 deletion detected by FISH in 9% of the nuclei of CLL 
cells (Table SI) was not found in the paired RS, suggesting its 
presence in a minor clone that was not selected for subsequent 
transformation, and consistent with the notion that absence 
of dell3ql4 is associated with a higher risk of transformation 
(Rossi etal.,2008). 

In the remaining three patients (20%) , a subset of the le- 
sions identified in the CLL clone was not detectable in the RS 
specimen. In particular, case 5 harbored one point mutation in 
the CLL phase that was absent both in the corresponding RS 
sample and in the final leukemic phase; in case 4, four point 
mutations were present in CLL at diagnosis and in the termi- 
nal leukemic phase, but not in the RS specimen (Fig. 1 and 
Table S2) ; in the third patient, characterized only by CN analy- 
sis (case 25), ^35% of CLL lesions {n = 8/23) were not main- 
tained in the RS diagnostic sample (Table S3). In all three 
cases, a variable number of genetic lesions were uniquely pres- 
ent in RS cells. These scenarios are consistent with a branching 
pattern of evolution, in which a putative precursor carrying 
the genetic lesions common to CLL and RS has then evolved 
toward CLL and RS through the acquisition of distinct and 
independent events. 

To address the evolutionary pattern followed in the pro- 
gression of RS from nodal to leukemic disease, we next fo- 
cused on the set of lesions present either in the RS or in the 
RSII phase. Notably, whereas case 2 progressed through a lin- 
ear evolution pattern, in both case 4 and case 5 the RS and its 
terminal leukemic phase acquired a totally or partially distinct 
set of genetic lesions, suggesting their progression through a 
branching pattern of clonal evolution (Figs. 1—3). In case 4, 
biaUelic TP5d disruption was acquired in both phases of RS, 
but this event clearly occurred in different tumor subclones, as 
documented by the different size of the deletion in 17p and by 
the distinct TP53 mutation observed in RS and RSII (R213X 
and R248Q, respectively; see Fig. 3, a and b). In case 5, multiple 
genetic lesions ultimately leading to NOTCH signaling activa- 
tion were acquired during CLL transformation and subsequent 
RS progression (Fig. 3, c and d). In particular, a NOTCHl 
PEST-truncating mutation (P2514fs) was present in the CLL 
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Figure 1 . Dynamics of genetic lesions associated with the transformation of CLL to RS. Histograms representing the number of genetic lesions: 
(a) shared between the 9 CLL and RS discovery pairs (data based on CN and FISH analyses and sequencing of NOTCHl, TP53, and SF3B1]; (b) clonally rep- 
resented in CLL and absent in the paired RS (data based on CN, FISH and WES analyses); and (c) acquired in the CLL-RS transition (data based on CN, FISH 
and WES analyses). In three cases (on the right portion of the graph), the terminal leukemic phase of RS (RSII) was included in the analysis (cases 2, 4, and 5). 
In case 5, the acquired lesions are based on WES and FISH data; none of the discovery cases carried mutations shared between CLL and RS in the ATM and 
MYD88 genes. Tx, translocation; M, mutation. 



phase and in all following RS biopsies; a homozygous frame- 
shift deletion in the SPEN gene (P1356fs), a putative negative 
regulator of NOTCH signaling targeted by inactivating muta- 
tions in splenic marginal zone lymphoma (Kuroda et al., 2003; 
Saito et al., 2003; Rossi et al., 2012c), was acquired at RS diag- 
nosis and maintained in the leukemic phase of the disease; and 
during the progression from nodal localization (RS) to periph- 
eral blood invasion (RSII), a second NOTCHl mutation 
targeting the heterodimerization domain of the protein was 
acquired (V1721M; Fig. 3, c and d).The detection of PEST and 
heterodimerization domain mutations in the same patient is of 
relevance, as mutations in these two distinct domains have been 
shown to act synergisticaUy in promoting signaHng activation 
(Weng etal.,2004). 

Overall, these patterns of somatic alterations are consistent 
with a predominandy linear evolutionary model, in which RS 
evolves through the addition of novel genetic lesions to those 
already present in the predominant CLL clone. In contrast, the 
blood localization of the disease seems to be often the result of 
a branching pattern of evolution, consistent with the selection 
and expansion of an ancestral precursor clone that was present 
in the patients at an earlier stage than the time of the initial 
diagnostic RS biopsy (Fig. 2) . 
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RS acquires a heterogeneous number 
and spectrum of genetic lesions 

To assess the load and the spectrum of genetic lesions acquired 
in the transition of CLL to RS, we next focused on the set 
of genetic aberrations present exclusively in the RS coding 
genome. WES analysis in the 9 CLL-RS pairs revealed the 
presence of 241 nonsUent mutations specific for RS, either at 
diagnosis (n = 165, range 0—118 events/ case) or in the terminal 
leukemic phase of the disease (;; = 76, range 0—36 events/case; 
Table S2; Fig. 1 c). A subset of these mutations {n = 13/241, 
5.4%) was already detectable in a small fraction of the CLL se- 
quencing reads (<20%), suggesting their presence at the sub- 
clonal level early during CLL development, and subsequent 
selection and outgrowth during transformation. Analogously, 
CN analysis of the same cases, and of the 6 pairs with only 
CN data available, revealed the presence of 138 CNAs 
specific for RS at diagnosis (n = 95, range 0— 29/case) or 
for the terminal leukemic phase {ii = 43, range 14—29; 
Table S3; Fig. 1 c). 

The combination of WES, CN, and FISH data obtained 
in the 9 pairs for which the complete set of analysis was avail- 
able showed that the RS genome acquires a mean of ^22 
genetic lesions/case, with a great variabihty across individual 
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Figure 2. Patterns of genetic evolution of CLL to RS. In each case the analyzed CLL clone is represented by a purple cell, the RS clone by a blue cell, 
and the RSII clone by a green cell (available in cases 2-4-5). Inferred clones are represented in orange. Bold black arrows depict the clonal path between 
samples and the time elapsed between samples is highlighted next to the gray lines interconnecting the analyzed time points. Each therapeutic line 
administered to patients is represented by a red star. The number of genetic lesions acquired in the CLL-RS transition is highlighted below the RS or RSII 
clone, respectively. TP, time-point; yr, years; Mo, months. 



samples (range 0— 130/case; Fig. 1 c). The only case (8-RS) in 
which we did not detect any RS-specific genetic lesion may 
have acquired genetic/epigenetic aberrations not assessable 
with our integrated approach (e.g., translocations, mutations in 
noncoding regulatory portions of the genome, or aberrant 
methylation; Kiippers and DaUa-Favera, 2001; KuHs et al., 
2012; Huang et al., 2013). The lack of acquisition of lesions in 
this patient was not caused by low tumor representation, as 
documented by the presence of a set of abnormalities, including 
a TP53 mutation, which were shared by the CLL and RS phase 
(Fig. 1 a). The load of alterations acquired by the teiminal leu- 
kemic phase of the disease was higher compared with the one 
observed in the CLL-RS transition, with ^40 acquired lesions 
per case on average (Fig. 1 c) . 

Although the number of cases studied is Umited, no signifi- 
cant difference in the load of acquired genetic lesions was ob- 
served between IQHV mutated or unmutated RS or in cases 
with different cHnical stages of the disease (Fig. 1; Table SI). 
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A detailed hst of genes altered in RS in the discovery panel 
is reported in Tables S2 and S3, with notes on their possible 
functional significance. 

Overall, these data show that the genetic lesions acquired in 
the transition of CLL to RS are quite heterogeneous in ternis 
of load and spectrum across samples. 

Recurrent targets of CN changes in RS: 
frequent loss of TP53 and CDKN2A/B 

One way to determine the relevance of newly identified ge- 
netic lesions is to assess their recurrence in the disease. Toward 
this end, we first extended the CN analysis to a larger cohort of 
RS {n = 42 cases). Consistent with the results obtained in the 
discovery panel, this analysis confirmed the high degree of het- 
erogeneity of the RS genome, with on average ^12.5 CNAs 
per case and a range of 0—55 aberrations per sample, mosdy 
represented by CN losses, which accounted for ^60% of the 
identified lesions (Fig. 4 a). 
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Figure 3. Representative examples of clonal evolution of genetic lesions during CLL transformation to RS and subsequent RS progression. 

(a and b) Different tumor clones acquire biallelic TP53 disruption in RS and RSI! in case 4. (a) Graphs reporting the variants' adjusted frequencies as iden- 
tified by WES analysis in the TP53 gene in the CLL, RS, and RSII phases of case 4. (b) Heatmap showing median-smoothed log2 CN ratio of the short arm 
of chromosome 1 7 in the CLL, RS, and RSII phases of case 4. (c and d) Progressive accumulation of multiple genetic lesions leading to NOTCH signaling 
activation in case 5. (c) Graphs reporting the variants' adjusted frequencies as identified by WES analysis in the NOTCH! and SPfW genes in the CLL, RS, 
and RSII phases of case 5. (d) Schematic of mutations affecting NOTCH! and SPfW in the CLL, RS, and RSII phases of case 5. 



We identified 107 focal minimal common regions (MCRs) 
of aberration (84 of loss and 23 of gain) encompassing 1—3 
protein-coding genes, which most hkely represent the target of 
the lesion (Fig. 4 b). The relevance of the identified lesions was 
confirmed through GISTIC, an algorithm based on the ampli- 
tude and frequency of occurrence of CN changes (Beroukhim 
et al., 2007; Fig. 4 d). To address the specificity of these newly 
identified recurrent regions of CN aberration for RS, we com- 
pared their frequency with that reported in a cohort of 353 newly 
diagnosed and previously untreated CLL cases (Edeknann et al., 
2012), focusing on genes altered in >10% of RS cases as well as 
on known recurrent cytogenetic abnormalities (Edelmann et al., 
2012; Fig. 4, b and c). 

Consistent with previous reports (Rossi et al., 2011b), the 
most frequent aberration significantly enriched in RS was rep- 
resented by dell7p, detected in ^40% of RS cases and in the 
majority of the cases involving the TP 5 3 tumor suppressor 
gene {ii = 18/20). Notably, the overall number of genetic 



lesions was significantly higher in TPJi-disrupted cases com- 
pared with TP53 wild-type cases, consistent with an increased 
genomic instability caused by this genetic abnormaUty (P < 
0.001; Fig. 4 e; Ouillette et al, 2010). Moreover, four TP53 
disrupted cases displayed complex intrachroniosonial rear- 
rangements caused by alternating gains and losses of genetic 
material, frequently accounting for >10 switches per chromo- 
some, possibly suggestive of chroniothripsis (unpublished 
data). A preferential association of this phenomenon with TPS 3 
lesions has been reported in CLL (Edelmann et al., 2012), as 
well as in other tumor types such as meduUoblastoma (Rausch 
et al, 2012). 

Our analysis identified del9p21 as a frequent genetic aber- 
ration commonly detectable in RS (^30% of cases) and shown 
to be specifically acquired at transformation in all cases in 
which the CLL phase was available {n = 3/3; Fig. 5, a and b). 
Consistently, del9p21 was never observed in unselected CLL 
cases (Edelmann et al., 2012). In seven out of the 15 affected 
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Figure 4. Overall picture of CNAs In the RS coding genome, (a) Overall number and frequency of CNAs in RS. (b) Minimal common regions (MCRs) 
of CN loss (top) or CN gain (bottom) encompassing 1-3 protein-coding genes and altered in >10°/o of RS cases {n = 50; blue and red bars], ranked by 
chromosome and frequency of alteration. Gray bars represent the frequency of loss (top) or gain (bottom) affecting the genes included in the annotated 
MCR in a series of 353 CLL cases as reported in Edelmann et al. (2012). All reported MCRs occurred with a significantly different frequency between CLL 
and RS based on a two-tailed Fisher's exact test (all with P < 0.001). (c) Frequency of known CLL-associated cytogenetic abnormalities in RS [n = 50 
cases, black bars) and in a series of 353 CLL cases as reported in Edelmann et al., 2012 (gray bars). Asterisks indicate lesions present with different fre- 
quency between CLL and RS based on a two-tailed Fisher's exact test (*, P < 0.05; **, P < 0.001). (d) Statistical analysis of CN losses (left, blue) and gains 
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of CNAs in 50 RS cases (all or grouped according to the presence or absence of TP53 disruption). **, P < 0.001 according to a nonparametric two-tailed 
Student's f test. 



cases (46.7%), the loss of genetic material encompassed <10 
genes, always including the CDKN2A/B locus. Disraption of 
CDKN2A/B was biaUeUc in over one third of cases (/; =6/15); 
in 3 of them, this was a result of reduplication of 1 allele 
carrying a hemizygous CDKN2A/B deletion, as assessed by 
LOH analysis, whereas one additional case harbored a hetero- 
zygous deletion with a CDKN2A sphce site mutation in the 
second allele (Fig. 5 c and Table S2). One patient (16-RS) 
harbored a focal CN loss encompassing only the CDKN2B 
gene, which encodes for pl5INK4B, a regulator of cell cycle 
arrest that is potently induced by TGF-(3 signaling (Hannon 
and Beach, 1994). Collectively, these results suggest that the 



CDKN2A/B locus is the specific target of the 9p21 deletions, 
which is consistent with observations in other tumor types 
(Sherr, 2004). 

CDKN2A encodes both for the negative regulator of cell 
cycle pl6INK4A (Serrano et al., 1993) and for ARF, an in- 
hibitor of the negative regulator of p53 MDM2 (Zhang et al., 
1998). We thus sought to analyze the distribution of aberra- 
tions involving this locus with other lesions known to affect 
key regulators of cell cycle control and DNA repair (Fig. 5 d). 
This analysis revealed that CDKN2A losses, TP53 disruption, 
and MYC-activating events often coexist. Interestingly, all 
these lesions were mutually exclusive with trisomy 12, another 
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Figure 5. Loss of 9p is commonly acquired in RS. (a) dChip inferred CN Ineat map sinowing tine magnification of deletions affecting the CDKN2A/B 
locus on chromosome 9 in the RS samples carrying the loss (three with paired CLL). The first three columns represent normal (diploid) samples. In the red- 
blue scale, white corresponds to a normal (diploid) CN log2-ratio, blue to deletion, and red to gain, (b) Percentage of RS samples harboring CDKN2A/B 
deletions, (c) Regions of copy-neutral LOH affecting the CDKN2A/B locus. The left-side shows the inferred probability of LOH for each case, which is dis- 
played from blue (1) to white (0.5) to yellow (0), and the right side shows a heatmap of the median-smoothed log2 CN ratio of the locus (blue is deletion 
and red is gain), (d) In the heatmap, rows correspond to the indicated aberrations and columns represent individual RS samples, color-coded based on the 
gene status (light grey wild-type; dark grey, altered), (e) Significant associations between genetic lesions are listed with their corresponding p-value. Prob, 
probability; N, mutual exclusion; P, positive association. 



recurrent alteration observed in '^30% of RS cases (P < 0.05). 
In addition, TP53 disruption was significantly (P < 0.05) asso- 
ciated with dell3ql4 and with RBI disruption (Fig. 5 e). 



Several additional CN changes occurred at higher fre- 
quency in RS compared with CLL, including trisomy 12, 
losses of the 7q31.31-36.3, 8p and 14q23.2-q32.33 loci, and 
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high level amplifications affecting the long arm of chromosomes 
8, 11, 13, and 18. The putative targets of these lesions included 
genes previously linked to cancer, such as POTl, a gene in- 
volved in telomeres protection, which was reported to harbor 
mutations in '^3% of CLL patients and also found to be mu- 
tated in 2 of our 27 RS (Table S2; Bauniann and Cech, 2001; 
Quesada et al., 2012; Landau et al., 2013; Ramsay et al., 2013); 
TRAF3, a negative regulator of noncanonical NF-kB that is 
fi-equently inactivated in multiple myeloma (Annunziata et al., 
2007; Keats et al., 2007); and the oncogenes MYC, ETSi, 
MIR17HG, and BCL2 (Vaux et al., 1988; Bories et al., 1995; 
He et al., 2005; Xiao et al., 2008). No point mutations were 
found in the aforementioned genes. 

Among the other CNAs that are recurrent in CLL patients, 
delllq and gains of the 2pl6.1-2pl5 locus occurred in RS at 
a frequency comparable to that of unselected CLLs (14 and 
8% vs. 27 and 7%, respectively; Fig. 4 c; Edeknann et al., 2012). 
In contrast, the frequency of dell3ql4 was significandy lower 
in RS compared to CLL cases at diagnosis ('^28 vs. 61%; P < 
0.001), consistent with the observation that its absence is an in- 
dependent risk factor for RS development and that this lesion, 
known to be an early event in CLL onset, was never acquired 
at RS (Edelniann et al., 2012; Rossi et al., 2008). 

Collectively, these results confirm the high degree of het- 
erogeneity of the RS genome, point to relevant genes and loci 
not previously hnked to RS, and provide a large array of novel 
candidates for mutational analysis. 

Recurrent targets of point mutations in RS 

To identify recureent targets of point mutations in RS, we per- 
formed WES analysis in a "screening panel" of 18 RS cases. 
Due to the unavailability of paired nomial DNAs, the mutation 
frequency was only assessed on those genes/ mutations for which 
previous information of functional relevance was available, in- 
cluding: (a) genes identified by WES in the discovery panel 
(tt = 231); (b) genes located in the focal MCR of an aberration 
with a frequency >10% (/( = 54); and (c) genes mutated in 
de novo DLBCL and CLL, as identified in recently published 
genome-wide studies (Fabbri et al., 2011; Morin et al., 2011; 
Pasqualucci et al., 2011; Puente et al., 2011; Wang et al, 2011; 
Lohr et al, 2012; Quesada et al., 2012; Landau et al., 2013) and 
listed in the Cancer Gene Census or for which mutations have 
been implicated functionally in cancer (n =215). Using these 
criteria, we interrogated a total number of 481 genes. Fig. 6 
shows the prevalence (including discovery and screening panels) 
of RS cases with mutations in genes altered with a fre- 
quency >10% (k = 23). 

To address the specificity of the newly identified recur- 
rent mutations for RS, we compared their frequency with 
the one previously reported in whole-exome/genome stud- 
ies performed in CLL patients (n = 205; Fabbri et al., 2011; 
Puente et al., 2011; Quesada et al, 2012; Wang et al, 2011). 
Indeed, with the exception of three genes previously linked 
to CLL (i.e., MYD88, SF3B1, and ATM; Stankovic et al, 
1999; Wang et al, 2011; Puente et al, 2011; Quesada et al, 
2012), 20/23 mutated genes were altered at a significantly 

2280 



higher frequency in RS (p-values in Fig. 6) compared with 
CLL. This finding is consistent with the notion that MYD88 
mutations seem to arise early in the natural history of CLL 
(Landau et al., 2013) and with the reported enrichment of 
SF3B1 mutations, which often coexist with ATM disruption 
(Wang et al., 2011) in CLL cases refractory to fludarabine- 
based therapy compared with CLLs at diagnosis and RS patients 
(Rossi et al., 2011a). 

Genes affected in >10% of RS patients were representative 
of a variety of biological programs, including DNA repair 
{TPS 3 and ATM; Stankovic et al, 1999; Zenz et al., 2010), 
NOTCH signaling pathway {NOTCHl and SPEN; Fabbri 
et al., 2011; Puente et al., 2011; Rossi et al., 2012c; Saito 
et al., 2003), transcription and chromatin remodeling {ZFHX3, 
ARIDIA, ARIDIB, MLL2, and IRX5; Sun et al, 2005; Grasso 
et al., 2012; Wu and Roberts, 2013), B cell development/ acti- 
vation {IRF8, PIMi, and MYD88; Donien et al, 1993; Wang 
and Morse, 2009; Rawlings et al. , 2012) , PI3K pathway (PREX2; 
Berger et al., 2012; Fine et al., 2009), cytoskeleton/extraceUular 
matrix organization and adhesion {PCLO, COL27A1, COL2A1, 
Sl^Pl, and SYNE2), ubiquitin proteolysis {USP8, USP34; 
Berlin et al., 2010; Lui et al., 2011), and splicing regulation 
{SF3B1; Golas et al., 2003; Fig. 6). A detailed description of the 
identified mutations is reported in Table S2, with remarks on 
their possible functional consequence. Overall, these data point 
to specific oncogenic biological programs and pathways in- 
volved in the pathogenesis of the disease, as described in the 
following section. 

Cellular programs commonly altered in RS 

To organize the spectrum of genetic lesions in cellular path- 
ways relevant for RS pathogenesis, we performed pathways/ 
gene functional clustering analysis using an integrated approach 
based on the combination of publicly available tools (i.e., 
GSEA and DAVID 2008; Mootha et al., 2003; Subramanian 
et al., 2005; Huang et al., 2009a,b) and manual annotation of 
the genes altered in RS, including (a) RS-WES mutated genes; 
(b) genes in focal MCR of aberration with a frequency >10%; 
and (c) "de novo DLBCL/CLL CGC genes" mutated in at 
least 2 RS cases (total ;/ = 304 genes). 

All used tools reported proliferation-, apoptosis-, and cell- 
cycle regulation in the top statistically significant categories, 
suggesting that deregulation of these processes is the driving 
force of this highly aggressive tumor that combines in a single 
disease the effects of chemoresistance and rapid disease kinet- 
ics. Based on our integrated analysis, the four genes most com- 
monly altered m RS were TP53, NOTCH!, MYC, and 
CDKN2A/B, which are well-characterized regulators of tumor 
suppression, cell proliferation, and cell cycle control, whose le- 
sions overall accounted for '^90% of RS cases (Figs. 6—8). Ad- 
ditional programs and pathways that emerged as significantly 
altered in RS included the MAPK, Wnt, NF-kB, and TGF-(3 
pathways, RNA processing/degradation, and protein transla- 
tion processes. 

Collectively, these results suggest that the transformation 
of CLL to RS involves a heterogeneous set of pathways in 
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Figure 6. Genes recurrently mutated in RS. Percentage of RS cases [n = 27) harboring mutations in genes altered witin a frequency >10'¥o (red bars). 
The mutation frequency of these genes in CLL (gray bar) was obtained in a dataset of 205 CLL cases analyzed for whole-exome/whole genome sequencing 
(see main text). Asterisks indicate genes differentially mutated between CLL and RS based on a two-tailed Fisher's exact test (*, P s 0.05; **, P < 0.001). 
Black triangles denote genes reported in the Cancer Gene Census database (http://www.sanger.ac.uk/genetics/CGP/Census/); green triangles indicate puta- 
tive or proven tumor suppressor genes; and red triangles denote putative or proven oncogenes. TSG, tumor-suppressor gene; ns, not significant. 



different cases, with those involving apoptosis and cell prolif- 
eration representing the most consistent ones. 

The coding genome of RS is distinct 
from the one of de novo DLBCL 

Although most RS cases are histologically classified as DLBCL, 
the response to treatment is significantly poorer in RS than in 
de novo DLBCL, even when considering the less curable acti- 
vated B cell— Uke subtype (Coiffier et al., 2002; Tsimberidou 
et al., 2006; Tsimberidou et al., 2008), suggesting a distinct un- 
derlying pathogenesis. To obtain a comparative assessment of 
the genetic landscape of these two diseases, we compared the 
frequency of the most prevalent genetic lesions in 27 RS cases, 
characterized for WES and CN data (see above), and in 71 
newly diagnosed and previously untreated DLBCL cases, with 
available CN and targeted resequencing data of genes com- 
monly mutated in DLBCL (Pasqualucci et al., 2011). We sepa- 
rately compared the frequency of genetic lesions in RS versus 
the two main subtypes of DLBCL, i.e., activated B cell— like 
(ABC, n = 40) and germinal center B cell like (GCB, 11 = 31) 
DLBCL (Fig. 8). For genes for which we lacked mutational 
analysis data, we used a recently pubHshed WES-dataset of 49 
primary DLBCL cases with no information regarding ABC 
versus GCB subtype (Lohr et al., 2012). 

Although almost all RS cases display a nongerminal center 
phenotype (Hans et al., 2004), the genetics of RS significantly 
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differed from that of de novo ABC-DLBCL, with the excep- 
tion of CDKN2A/B losses and point mutations affecting 
MYD88, which were, overall, observed at comparable fre- 
quency. In particular, several lesions typically associated with 
ABC-DLBCL were rare or absent in RS. Significant differ- 
ences included biaUelic inactivation of the TNFAIP3 /A20 
and PRDMl /BLIMPl tumor suppressor genes and BCL6 
translocations, which were never observed in RS, whereas they 
are associated with ^30, ^22, and 37.5% of ABC-DLBCL, 
respectively (P < 0.01; Fig. 8). Additional lesions typically as- 
sociated with ABC-DLBCL, including mutations of PIMi, 
CD79B, and CARDU, as well as BCL2-CN gains, were rel- 
atively rare in RS (11.1 vs. 17.5%, 7.4 vs. 25%, 7.4 vs. 12.5%, 
and 22.2 vs. 42.5%, respectively). 

Among the lesions typically associated with GCB-DLBCL, 
AfYC-activating events presented comparable frequencies in the 
two diseases (Fig. 8), whereas mutations involving CREBBP 
and EZH2, which were observed in 22.6 and 9.7% of GCB- 
DLBCL, respectively, never occurred in RS. The difference 
was statistically significant (P < 0.05) only for CiiEBPP muta- 
tions, most Hkely due to the lower frequency of EZH2 muta- 
tions in GCB-DLBCL. BCL2 translocations, present in ^30% 
of GCB-DLBCL, were never observed in RS, as previously 
reported (Rossi et al., 2011b; P < 0.01). 

Finally, among the lesions equally represented in both 
DLBCL subtypes, some occurred with similar frequency in 
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the two diseases (i.e., MIR17HG gain, 8p loss, and deletion 
ofPTEN). B2Mbiallelic disruption and gains of chromosome 
7 occurred less frequently in RS compared with DLBCL 
(0 vs. 16.9%, and 7.4 vs. 38%, respectively; P < 0.05). The 
frequency of MLL2 and PCLO mutations was also lower in 
RS compared with DLBCL (15 vs. 35% and 11.1 vs. 22.5%), 
but this difference did not reach statistical significance (Fig. 8). 
Strikingly, the two most frequent genetic lesions that charac- 
terize the RS genome, i.e., TP53 biallelic disruption and 
NOTCH1 mutations, were significantly enriched in RS 
compared with both subtypes of DLBCL (P < 0.01), thus 
confirming their specific relevance in the pathogenesis of the 
disease (Fig. 8). 

Altogether, these data underline the fact that, despite the 
morphological and phenotypic similarities between RS and 
de novo DLBCL, the molecular profile of these two condi- 
tions is largely different, suggesting that they represent dis- 
tinct disease entities. 

DISCUSSION 

This study was aimed at elucidating relevant questions regard- 
ing RS pathogenesis, including its pattern of clonal evolution 
from CLL, the genetic determinants of the transformation 
event, and the pathogenetic relationship between RS and clas- 
sical non-CLL associated de novo DLBCL. Although these 
questions have been previously addressed using mostly candi- 
date gene approaches, we have used unbiased whole-exonie 
and CN analysis for a comprehensive definition of these issues. 
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The obtained results provide insights into RS pathogenesis 
and have diagnostic and therapeutic imphcations. 

The clonal evolution of any given tumor type occurs either 
through a linear model, in which the predominant clone ac- 
quires novel genetic lesions leading to progression, or a branch- 
ing model, in which a common ancestor evolves to the initial 
tumor and to the progressed stage through distinct genetic 
pathways (Aparicio and Caldas, 2013). Our results show that 
the CLL-RS transition occurs through a linear model of evolu- 
tion in most cases, where RS represents the final stage of evolu- 
tion of the original founder CLL clone.This result is consistent 
with early observations based on the analysis of intraclonal di- 
versification of 7GHK genes in paired CLL-RS (Rossi et al., 
2012b). In a minor fraction of cases, RS seems to be the result 
of the branching evolution of CLL and RS from an earher 
conmion precursor cell, perhaps consistent with the observa- 
tion that early hematopoietic stem cells from CLL patients can 
acquire propensity to generate clonal B cells (Kikushige et al., 
2011). Both Unear and branching patterns of clonal evolution 
have been described in CLL chnical progression (Schuh et al., 
2012; Landau et al.,2013; OuiUette et al., 2013), as well as in the 
transformation of follicular-lymphoma (FL) to DLBCL (Carlotti 
et al., 2009; and unpublished data), although in the latter case a 
branching pattern of evolution is predominant. This is also the 
case in the rare progression of RS from a nodal to a leukemic 
disease, at least in two out of the three cases analyzed in this 
study. The terminal leukemic phase of RS might be the result 
of the selection and outgrowth of a subclone, perhaps driven by 
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the highly cytotoxic therapeutic regimes used for RS treatment 
(Rossi and Gaidano, 2009). 

Moreover, our data reveal that no single lesion or combi- 
nation of genetic lesions or cellular pathways seem to be 
responsible for CLL transformation to RS, with various RS 
cases differing in both the number and the type of aberra- 
tions. Nonetheless, the fact that ^90% of the cases display 
conibmations of lesions of TP53, NOTCHl, MYC, and CD- 
KN2A/B suggests that RS transformation does not involve 
primarily specific B cell signaling pathways, but rather gen- 
eral regulators of tumor suppression, cell cycle control and 
cell proliferation. Whereas previous reports have already 
identified TP53 inactivation, and the mutually exclusive 
NOTCHl and AfYC-activating events as common in RS 
pathogenesis (Fabbri et al., 2011; Rossi et al., 2011b), our re- 
sults identify loss of the CDKN2A/B locus as present in 
^30% of RS cases and often co-occurring with the two other 
most frequent lesions observed in RS, namely TP53 disrup- 
tion and NOTCHl mutations. The individual and combined 
role of these lesions can be studied further in appropriate 
mouse models. Overall, we note that RS transformation 



appears to involve pathways that are distinct from those ac- 
companying CLL progression and/or acquisition of chemo- 
resistance, the latter being recurrently associated with lesions 
of the SF3B1 gene, encoding a component of the spliceo- 
some, and of the BIRC3 gene, encoding a negative regulator 
of the NF-kB pathway, although largely lacking MYC ac- 
tivation and CDKN2A/B disruption that are common in 
RS (Wang et al, 2011; Quesada et al, 2012; Rossi et aL, 
2011a,b). 

The results herein also indicate that the DLBCL disease 
typical of RS is pathogenetically distinct from both forms 
(GCB and ABC) of de novo DLBCL (Morin et al., 2011; 
Pasqualucci et al., 2011). The most relevant differences involve 
the common activation of NOTCH and inactivation of TP53 
pathways, which are virtually absent or rare in de novo 
DLBCL, respectively. Furthermore, RS-DLBCL lacks lesions 
common in all DLBCL types, such as inactivation of the acetyl- 
transferase genes CREBBP/EP300 and of the B2M gene, as 
well as those conmion in ABC-DLBCL (e.g., translocations of 
BCL6 and loss oi PRDMl/BLIMPi and TNFAIP3/A20) or 
GCB-DLBCL (e.g., translocations of BCL2; Challa-Malladi 
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et al, 2011; Lohr et al., 2012; Morin et al, 2011; Pasqualucci 
et al., 2011). These differences indicate that DLBCL trans- 
formed from CLL and classical DLBCL represent distinct dis- 
ease entities. 

Finally, these findings have diagnostic and therapeutic im- 
plications. The higher frequency of specific lesions in RS com- 
pared with CLL suggests that their early detection can be tested 
for its prognostic significance in predicting CLL evolution 
toward RS. Moreover, the specific presence (e.g., NOTCH! 
mutations) or absence (e.g., inactivation of CREBBP/EP300 
and translocations of BCL2) of defined lesions in RS imply 
distinct approaches in identifying targeted therapies from those 
currently being tested for DLBCL. In particular, two of the 
most common lesions occurring in RS, namely NOTCHl 
mutations and CDKN2A/B deletions, represent established 
therapeutic targets with some drugs being already available for 
testing in single or combinatorial regimens (Monti et al., 2012; 
Samon et al., 2012; Wu et al., 2010). 

MATERIALS AND METHODS 

Primary cases. The discovery panel comprised 21 samples from 9 paired 
CLL-RS cases (9 CLL-RS pairs, plus 3 paired terminal leukemic phases of 
RS). Diagnosis of CLL was based on IWCLL-NCI Working Group criteria 
(Hallek et al., 2008) and confirmed by a flow cytometry score >3, whereas 
diagnosis of RS was based on the histology of lymph node or extranodal tissue 
excisional biopsies. After institutional pathological review, all RS cases were 
classified as DLBCL according to the World Health Organization Classifica- 
tion of Tumors of the Hematopoietic and Lymphoid Tissues (Swerdlow, 
2008). The clinical and biological features of the discoveiy cases are summa- 
rized in Table SI. 

The screening panel used for additional CN analysis was composed of a 
multi-institutional cohort of 42 clonally related RS, 17 of which were also 
characterized by WES. An additional RS case with no CN data available was 
included in the WES-sequencing panel. 

The cases included in this study were provided by the Division of 
Hematology, Department of Translational Medicine, Amedeo Avogadro 
University of Eastern Piedmont, Novara, Italy and by the Oncology Institute 
of Southern Switzerland (lOSI), BeUinzona, Switzerland. The study was ap- 
proved by the Institutional Review Board of Columbia University and lOSI 
and by the Ethical Committee of the Azienda Ospedaliera Maggiore deUa 
Carita di Novara, Amedeo Avogadro University of Eastern Piedmont. 

DNA extraction. High molecular-weight genomic DNA was extracted from 
samples according to standard procedures, quantitated by the Quant-iT Pico- 
Green reagent (Invitrogen) and verified for integrity by gel electrophoresis. 
Clonahty of tumor samples was established by ampUfication of the rearranged 
Ig genes. The percentage of tumor cells of CLL and RSII samples was esti- 
mated by cytofluorometry analysis of CD5"'"/CD19"^ cells in the peripheral 
blood, and the one of RS specimens by immunohistochemistry analysis of 
CD20"^/CD79a"^ cells in the correspondent RS section. In all phases, the per- 
centage of tumor cells was then corrected based on the inferred CN value ob- 
served at the clonally rearranged Igloci (i.e., the segment of intrachromosomal 
deletional recombination observed at the 14q32.33, 2pll.2, and 22qll.2 loci; 
Bergsagel and Kuehl, 2013). AH analyzed cases had at least 35% of tumor cell 
content in the specimen used for WES and CN analyses (Table SI). 

FISH and IGHV mutation analysis. The molecular characterization of 
RS cases, including FISH analysis and assessment of /Gl/K mutational status, 
was performed as previously described (Fabbri et al., 2011; Rossi et al., 
2011b). In particular, the following probes were used for FISH analysis of 
the CLL-RS discovery pairs: (1) LSI13, LSID13S319, CEP12, LSIp53, 
LSIATM, LSI IGH/BCL2, LSI BCL6, LSI IGH/c-MYC/CEP8, c-MYC 
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break-apart, LSI N-MYC (Abbott); (2) BCL3 spht signal (Dako); (3) 6q21/ 
a-satellite (Kreatech Biotechnology); and (4) bacterial artificial chromosome 
clones 373L24-rel and 440P05-BCL1 lA. 

Whole-exome capture and massively parallel sequencing, sequence 
mapping, and identification of tumor-specific variants. Purified high 
molecular weight genomic DNA ('^1.5 )-ig) was enriched in protein-coding 
sequences using the SureSelect 50 Mb AH Exon kit for the discovery cases and 
the SureSelect Human AH Exon v4— 51Mb for the screening cases (Agilent 
Technologies), following standard protocols. The resulting target-enriched 
pool was amplified and subjected to paired-end sequencing (2 x 100 bp) on 
the HiSeq2000 System. Exome capture and sequencing procedures were per- 
formed at St. Jude Children's Research Hospital, Fasteris, and CentriUion 
Biosciences. The mean depth of coverage for aUgned reads was 83X (range, 
34-158X), with ^^75% of the target sequence being covered by at least 30 
reads (range, 48—89%). Based on a comparison with the heterozygous SNP call 
rate obtained in the same cases by analysis of Aflymetrix SNP 6.0 array data 
(available for 20/21 samples), the sensitivity for the identification of heterozy- 
gous somatic mutations was estimated to be '^97% (range, 94—99%). After fil- 
tering for duplicate reads (defined as reads with identical start and orientation), 
sequencing reads were mapped to the reference genome hgl9 assembly 
(GRCh37) using the Burrows- Wheeler Aligner (BWA) alignment tool ver- 
sion 0.5.9 (Li and Durbin, 2010). Sequence variants (i.e., nucleotide substitu- 
tions and small insertions/ deletions) were obtained using the SAVI (Statistical 
Algorithm for Variant Identification) algorithm independently for each sample 
(Tiacci et al., 2011). In brief, we constructed empirical priors for the distribu- 
tion of variant frequencies in each sample independendy and obtained high- 
credibihty intervals (posterior probability >1— 10~^) for corresponding changes 
in frequency between any of the analyzed samples. Variants were considered 
absent when observed with a frequency between 0 and 2%, and present when 
showing a frequency >3%. Given a mean depth of coverage for aligned reads 
>80X across samples, we chose 3% as the lower threshold for detecting vari- 
ants, considering a 1% error rate for the technology and a binomial distribution 
for the errors. Candidate variants were then filtered for (a) systematic errors 
known to be associated with the lUumina sequencing technology, (b) variants 
observed in only one strand, and (c) variants mapping to multiple loci in the 
genome, which may reflect captured pseudogenes and regions of low com- 
plexity. To this end, each variant with a flanking 35-base context sequence 
around its genomic position was mapped to the hgl9 reference using the 
BLAST algorithm, and only variant reads with "unique mappabiUty" were re- 
tained, that is, we required the 71-base sequence to uniquely map to the refer- 
ence genome with only one mismatch. Candidate tumor-specific nonsilent 
variants were then obtained by subtracting (a) known polymorphisms reported 
in the NCBI dbSNP database (BuHd 137); (b) variants present in any one of 
105 exomes from unaffected individuals analyzed at our institution; and (c) si- 
lent mutations. In addition, splice site variants were kept only if located within 
4 nt of a consensus sphce site. 

As the percentage of tumoral cells present in die analyzed specimens was 
not homogeneous across samples (Table SI), we corrected the variants' fre- 
quencies by calculating the expected frequency based on the total depth of the 
variant, its observed frequency, and the tumor content of the sample, assuming 
a binomial distribution. Mutations were classified as clonal if the fraction of 
variant reads was >20 (upon correction for the percentage of tumor cells in the 
sample), and subclonal otherwise. 

The genomic positions of the candidate variants were converted to the 
hgl8 coordinates using the LiftOver tool (UCSC Genome Browser) to be con- 
sistent with the annotation used to report CN aberrations. 

For the purpose of deciphering the history of clonal evolution during 
CLL transformation to RS, CLL-only variants were defined as events clonally 
represented in CLL and absent or subclonal in RS and/ or RSII and RS-only 
variants included events clonally represented in RS and absent/subclonal in 
CLL or subclonal in RS and absent in CLL. The same criteria were appHed to 
identify RSII-only variants. To establish whether the difference in representa- 
tion of the variants in distinct disease phases was statistically significant, we 
calculated the probability of observing the variant's depth, given its position 
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total depth, its observed frequency in the paired sample, and the estimates for 
samples' tumor contents, using a binomial distribution. Variants with p-values 
>0.05 were discarded. 

Previously reported genetic lesions detected in the same cases by Sanger 
sequencing {TP53, NOTCHl, and SF3B1) were all correctly identified by our 
approach (Table SI). Only one 50-bp duplication affecting the PEST domain 
of NOTCHl was missed by WES, consistent with the lower sensitivity of our 
mutation discovery algorithm in detecting large insertions and deletions. A set 
of candidate RS-specific nonsilent somatic mutations were subjected to valida- 
tion by conventional Sanger-based resequencing analysis of PGR products ob- 
tained from tumor DNA using primers specific for the exon encompassing the 
variant, as previously described (Fabbri et al., 2011; Pasqualucci et al., 2011). 
The vahdatioii rate of a subset of the identified point mutations tested by 
Sanger sequencing was ^94% {ti = 133/141). 

Given the absence of paired nomial DNAs, SAVI was used to interrogate 
a specific Hst of target genes (« = 481) in the WES-screening panel, as de- 
scribed in the section "Recurrent targets of point mutations in RS." 

CN analysis by high-density SNP array analysis. Genome-wide DNA 
profiles from 50 RS samples, of which 14 with paired CLL, including 8 out of 
9 discovery RS cases, 3 RS-leukemic phases, and a series of normal samples 
(rt = 7) were obtained from high molecular weight genomic DNA using the Af- 
fymetrix Genome- Wide Human SNP Array 6.0 (Affymetrix) and standard pro- 
tocols fi-om Affymetrix as previously described (Fabbri et al., 2011; Pasqualucci 
et al., 201 1). Previously reported genetic lesions detected by FISH-based anal- 
ysis (when present in >25% of nuclei) were all correctly identified by our ap- 
proach (Table SI). Data analysis, including the identification of Minimal 
Common Regions of aberration and the Genomic Identification of Signifi- 
cant Targets in Cancer (GISTIC), was performed as previously described 
(Fabbri et al., 2011; Pasqualucci et al., 2011). In brief, we focused on focal 
MCRs encompassing <3 protein- coding genes and not determined by the 
overlap between large segments (i.e., >1 Mb; Pasqualucci et al., 2011). Re- 
current focal MCRs are shown in Fig. 4 b and significant regions of loss and 
gain as assessed by GISTIC are reported in Fig. 4 d. 

Identification of significant association and mutual exclusion be- 
tween RS-associated genetic lesions. To determine the independence of 
any two genetic events occurring in the RS cases, we counted the number of 
samples involved in both events and calculated a p-value according to the hy- 
pergeometric distribution. If the number was significantly larger than expected, 
the two events were considered comutated, whereas if the number was signifi- 
cantly smaller they were considered mutually exclusive. We used a p-value 
threshold of 0.05 to assess the significance of both mutual-exclusion and comu- 
tation. Due to the limited number of samples, we did not use multi-test correla- 
tion in this experiment. 

Functional categories and pathways analyses of the mutated genes. 

Genes found to be altered in RS were assigned to functional categories or 
to annotated pathways using the gene ontology GO database (http://www 
.geneontology.org/), the publicly available bioinfonnatic tool DAVID 2008 6.7 
(Database for Annotation, Visualization and Integrated Discovery, http:/ /david 
.abcc.ncifcrf gov/), and the Molecular Signatures Database from the Broad Insti- 
tute (MSigDBv4.0, http://www.broadinstitute.org/gsea/nisigdb/index.jsp). In 
brief, the functional annotation chart tool of DAVID 2008 was used to identify 
enriched functional categories associated with RS-altered genes (including the 
"GOTERM_BP_ALL" and the "Pathways" terms); the Compute Overlap 
tool of the Molecular Signatures Database was used to examine how the gene 
set composed by RS-altered genes overlapped with gene sets belonging to the 
C2-Canonical pathways collection. DAVID 2008 categories and MSigDB 
overlaps were then selected to retain only those with a P value <0.05. 

Accession nos. Affymetrix SNP Array 6.0 data and WES data have been 
deposited in dbGaP under accession no. phs000364.v2.pl. 

Online supplemental material. Table SI shows clinical and biologi- 
cal features of the 9 CLL-RS discovery cases; Table S2 reports mutations 



identified by WES in the discovery and screening panels; and Table S3 
illustrates the segments (regions) of tumor-acquired CN alterations identi- 
fied in the discovery panel and in additional six paired and clonally related 
CLL-RS pairs. Online supplemental material is available at http://www.jem 
.org/cgi/content/full/jem.20131448/DCl. 
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